<?xml version="1.0" encoding="UTF-8"?><article article-type="normal" xml:lang="en">
   <front>
      <journal-meta>
         <journal-id journal-id-type="publisher-id">PALEVO</journal-id>
         <issn>1631-0683</issn>
         <publisher>
            <publisher-name>Elsevier</publisher-name>
         </publisher>
      </journal-meta>
      <article-meta>
         <article-id pub-id-type="pii">S1631-0683(13)00107-3</article-id>
         <article-id pub-id-type="doi">10.1016/j.crpv.2013.07.001</article-id>
         <article-categories>
            <subj-group subj-group-type="type">
               <subject>Research article</subject>
            </subj-group>
            <subj-group subj-group-type="heading">
               <subject>General palaeontology, systematics and evolution (Phylogenetic analysis)</subject>
            </subj-group>
         </article-categories>
         <title-group>
            <article-title>Evaluating strategies of phylogenetic analyses by the coherence of their results</article-title>
            <trans-title-group xml:lang="fr">
               <trans-title>Évaluation des stratégies d'analyse phylogénétique par la cohérence de leurs résultats</trans-title>
            </trans-title-group>
         </title-group>
         <contrib-group content-type="editors">
            <contrib contrib-type="editor">
               <name>
                  <surname>Laurin</surname>
                  <given-names>Michel</given-names>
               </name>
               <email/>
            </contrib>
         </contrib-group>
         <contrib-group content-type="authors">
            <contrib contrib-type="author">
               <name>
                  <surname>Li</surname>
                  <given-names>Blaise</given-names>
               </name>
               <email>blaise.li@normalesup.org</email>
            </contrib>
            <aff-alternatives id="aff0005">
               <aff> Centro de Ciências do Mar, Universidade do Algarve, Campus de Gambelas, 8005-139 Faro, Portugal</aff>
               <aff>
                  <institution>Centro de Ciências do Mar, Universidade do Algarve</institution>
                  <addr-line>Campus de Gambelas</addr-line>
                  <city>Faro</city>
                  <postal-code>8005-139</postal-code>
                  <country>Portugal</country>
               </aff>
            </aff-alternatives>
         </contrib-group>
         <pub-date-not-available/>
         <volume>12</volume>
         <issue seq="6">6</issue>
         <issue-id pub-id-type="pii">S1631-0683(13)X0007-7</issue-id>
         <issue-title>Systematics beyond Phylogenetics / La systématique au-delà de la phylogénétique</issue-title>
         <fpage seq="0" content-type="normal">381</fpage>
         <lpage content-type="normal">387</lpage>
         <history>
            <date date-type="received" iso-8601-date="2013-04-30"/>
            <date date-type="accepted" iso-8601-date="2013-07-04"/>
         </history>
         <permissions>
            <copyright-statement>© 2013 Académie des sciences. Published by Elsevier B.V. All rights reserved.</copyright-statement>
            <copyright-year>2013</copyright-year>
            <copyright-holder>Académie des sciences</copyright-holder>
         </permissions>
         <self-uri xmlns:xlink="http://www.w3.org/1999/xlink" content-type="application/pdf" xlink:href="main.pdf">
                        Full (PDF)
                    </self-uri>
         <abstract abstract-type="author">
            <p id="spar0005">I propose an approach to identify, among several strategies of phylogenetic analysis, those producing the most accurate results. This approach is based on the hypothesis that the more a result is reproduced from independent data, the more it reflects the historical signal common to the analysed data. Under this hypothesis, the capacity of an analytical strategy to extract historical signal should correlate positively with the coherence of the obtained results. I apply this approach to a series of analyses on empirical data, basing the coherence measure on the Robinson–Foulds distances between the obtained trees. At first approximation, the analytical strategies most suitable for the data produce the most coherent results. However, risks of false positives and false negatives are identified, which are difficult to rule out.</p>
         </abstract>
         <trans-abstract abstract-type="author" xml:lang="fr">
            <p id="spar0010">Je propose une approche pour identifier, parmi plusieurs stratégies d’analyse phylogénétique, celles aux résultats les plus fiables. Cette approche se base sur l’hypothèse que, plus un résultat est reproduit à partir de données indépendantes, plus il reflète le signal historique commun aux données analysées. Sous cette hypothèse, la capacité d’une stratégie d’analyse à extraire le signal historique devrait être positivement corrélée à la cohérence des résultats obtenus. J’applique cette approche à une série d’analyses sur des données empiriques, en basant la mesure de cohérence sur les distances de Robinson–Foulds entre les arbres obtenus. En première approximation, les stratégies d’analyse les plus adaptées aux données produisent les résultats les plus cohérents. Cependant, des risques de faux positifs et de faux négatifs, difficiles à écarter, sont identifiés.</p>
         </trans-abstract>
         <kwd-group xml:lang="en">
            <unstructured-kwd-group>Chloroplasts, Coherence, Cyanobacteria, Methods, Phylogeny</unstructured-kwd-group>
         </kwd-group>
         <kwd-group xml:lang="fr">
            <unstructured-kwd-group>Chloroplastes, Cohérence, Cyanobactéries, Méthodes, Phylogénie</unstructured-kwd-group>
         </kwd-group>
      </article-meta>
   </front>
   <body>
      <sec id="sec0005">
         <label>1</label>
         <title id="sect0025">Introduction</title>
         <p id="p0005">An important breakthrough for molecular phylogeny reconstruction has been made with the introduction of probabilistic approaches (<xref rid="bib0035" ref-type="bibr">Felsenstein, 1981</xref> and <xref rid="bib0150" ref-type="bibr">Yang and Rannala, 1997</xref>), directly and explicitly using molecular evolution models. This usually reduces the occurrences of reconstruction artifacts, in particular in studies at large evolutionary scales (but see <xref rid="bib0135" ref-type="bibr">Simmons, 2012</xref>). In parallel with an increased availability of data (which permits a better estimation of the parameters of complex models) and computational power (which permits the exploration and evaluation of a large number of possible trees), the development of probabilistic methods was accompanied with the development of models that take into account an increasing number of aspects of molecular evolution such as evolutionary rate (<xref rid="bib0145" ref-type="bibr">Yang, 1993</xref>) or composition (<xref rid="bib0050" ref-type="bibr">Foster, 2004</xref> and <xref rid="bib0075" ref-type="bibr">Lartillot and Philippe, 2004</xref>) heterogeneities. The accuracy of phylogenies can also be enhanced by using character selection or recoding techniques (<xref rid="bib0010" ref-type="bibr">Brinkmann and Philippe, 1999</xref>, <xref rid="bib0055" ref-type="bibr">Goremykin et al., 2010</xref>, <xref rid="bib0060" ref-type="bibr">Hassanin et al., 2005</xref>, <xref rid="bib0065" ref-type="bibr">Inagaki et al., 2004</xref> and <xref rid="bib0130" ref-type="bibr">Roure and Philippe, 2011</xref>).</p>
         <p id="p0010">However, the diversity of methods and models available makes it difficult to decide which strategy to adopt when trying to reconstruct a phylogeny. Some methods are available to help the phylogeneticist in this choice. For instance, programs like jModelTest (<xref rid="bib0115" ref-type="bibr">Posada, 2008</xref>) use a variety of criteria to select a model achieving a good compromise between realism and tractability. But such readily available tools are limited to the set of models implemented in the phylogeny programs on which they rely. It is also common practice to compare phylogenies obtained using different models by applying selection criteria identical to those used in <italic>a posteriori</italic> model selection programs, which extends these selection approaches to arbitrary models. Still, the model is only one aspect of the analytical strategy: Data selection or recoding techniques also need to be chosen prior to the tree construction, a program and its specific settings have to be chosen, and support evaluation procedures can take diverse forms. All of these aspects form the analytical strategy that leads from the raw data to an annotated tree ready for drawing phylogenetic conclusions.</p>
         <p id="p0015">An approach suitable for the choice of such integrated analytical strategies could be to make the choice <italic>a posteriori</italic>, based on their results. A variety of analyses would be performed, and the ones producing the most accurate results would be chosen. This immediately raises the question as to how to evaluate the accuracy of a phylogeny reconstruction. Measures such as bootstrap proportions (<xref rid="bib0040" ref-type="bibr">Felsenstein, 1985</xref>) or Bayesian posterior probabilities are sometimes regarded as reliability indicators, but they must be interpreted in the limited context of the particular dataset that has been analysed. Other datasets may yield different support values (or even contradictory results) and these values do not correlate perfectly with one another (<xref rid="bib0030" ref-type="bibr">Douady et al., 2003</xref>). Reliability of phylogenetic relationships is arguably better estimated when considering trees obtained from several independent datasets, and examining the degree to which the results are reproduced across these datasets (<xref rid="bib0015" ref-type="bibr">Chen et al., 2003</xref>, <xref rid="bib0025" ref-type="bibr">Dettai and Lecointre, 2004</xref>, <xref rid="bib0085" ref-type="bibr">Li and Lecointre, 2009</xref> and <xref rid="bib0100" ref-type="bibr">Miyamoto and Fitch, 1995</xref>). In this context, it has been observed that the reproducibility of the results was higher when a better modelling of the data was used (<xref rid="bib0105" ref-type="bibr">Miyamoto et al., 1994</xref>). This justifies a widespread practice consisting in using more complex models and methods when the phylogeny appears more challenging to resolve. This also suggests that result coherence could indeed correlate positively with accuracy.</p>
         <p id="p0020">The purpose of the present article is to report an attempt to use the <italic>a posteriori</italic> approach for selecting strategies of phylogenetic analyses using the reproducibility of the results as a criterion, and to discuss some potential pitfalls of such an approach.</p>
      </sec>
      <sec id="sec0010">
         <label>2</label>
         <title id="sect0030">Materials and methods</title>
         <sec id="sec0015">
            <label>2.1</label>
            <title id="sect0035">Test data</title>
            <sec>
               <p id="p0025">The <italic>a posteriori</italic> approach was tested on empirical multi-gene data assembled in the ambit of a yet-to-be-published work on the phylogeny of Cyanobacteria and plastids (<xref rid="bib0090" ref-type="bibr">Li et al., in preparation</xref>). Given the large evolutionary scale, as well as the potential existence of horizontal gene transfers, such a dataset should provide enough reconstruction challenge so that different analytical strategies will have different reconstruction accuracies, and show various degrees of result coherence.</p>
            </sec>
            <sec>
               <p id="p0030">The data consists of 73 protein-coding genes from 42 Cyanobacteria, plastids or nuclear genes of plastidial origin. The genes were grouped in 4 sets that were considered internally congruent and between them incongruent by the concaterpillar program (<xref rid="bib0080" ref-type="bibr">Leigh et al., 2008</xref>). This program performs a series of likelihood ratio tests under a GTR + I + Γ model, to evaluate whether datasets can be forced to share topologies and branch lengths or if separate trees provide a significantly better likelihood. Results of maximum likelihood analyses under a GTR + I + Γ model should therefore provide a reference situation where some incoherence effectively appears between the datasets. More accurate strategies than maximum likelihood analysis under a GTR + I + Γ model might be able to recover more of the history common to all datasets, for each one of them, and therefore be characterised by a higher coherence in the results.</p>
            </sec>
         </sec>
         <sec id="sec0020">
            <label>2.2</label>
            <title id="sect0040">Analytical strategies tested</title>
            <sec>
               <p id="p0035">For each of the 4 combined datasets, a series of various analytical strategies were applied. A name is associated with each of them to facilitate reporting and discussion of the results.</p>
            </sec>
            <sec>
               <p id="p0040">Maximum likelihood bootstrap analyses were conducted using RAxML versions 7.0.4 and 7.3.4 (<xref rid="bib0140" ref-type="bibr">Stamatakis, 2006</xref>) under a GTR + I + Γ model, with 200 pseudo-replicates of the data. For these analyses, the original data matrices were used, their amino-acid translations (for which a CPREV + I + Γ model was used) as well as some versions of these matrices where diverse combinations of sites were subjected to codon-degeneracy recodings.</p>
            </sec>
            <sec>
               <p id="p0045">A codon-degeneracy recoding is based on the replacement of codons by degenerate versions that represent all codons coding the same amino-acid. Nucleotides are replaced by IUPAC ambiguity codes at codon positions where several codons for the same amino-acid differ.</p>
            </sec>
            <sec>
               <p id="p0050">The goal of these recodings is to eliminate potentially misleading signal. The signal considered for removal corresponds to sites involved in codon synonymy. Due to the relaxed selection on the nucleotide at such sites, convergence between sequences sharing the same bias in their genome's nucleotide composition may have happened and mislead phylogenetic reconstruction (see for instance <xref rid="bib0020" ref-type="bibr">Cox et al., 2008</xref>, <xref rid="bib0050" ref-type="bibr">Foster, 2004</xref>, <xref rid="bib0060" ref-type="bibr">Hassanin et al., 2005</xref>, <xref rid="bib0110" ref-type="bibr">Nabholz et al., 2011</xref> and <xref rid="bib0125" ref-type="bibr">Rota-Stabelli et al., 2013</xref>). The most useful of the recodings should affect mostly sites where the proportion of misleading signal is the highest, and eliminate only a small proportion of the historical signal. A higher coherence of the results is expected for such recodings.</p>
            </sec>
            <sec>
               <p id="p0055">For reasons pertaining to the organization of the genetic code (that will not be detailed here), three categories of codon positions were distinguished: third codon positions of any amino-acid, leucine and arginine first codon positions, serine first and second codon positions.</p>
            </sec>
            <sec>
               <p id="p0060">The maximum likelihood analytical strategies were the following (see also supplementary document available at <ext-link xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="http://dx.doi.org/10.6084/m9.figshare.732758">http://dx.doi.org/10.6084/m9.figshare.732758</ext-link> for the recodings):<list>
                     <list-item id="listitem0005">
                        <label>-</label>
                        <p id="p0065">‘unrecoded’: the original matrix is used, without codon-degeneracy recoding. This strategy is used as a reference, where some degree of incoherence is expected between the datasets delimited by concaterpillar;</p>
                     </list-item>
                     <list-item id="listitem0010">
                        <label>-</label>
                        <p id="p0070">‘degen3’: the codons are replaced with degenerate versions representing all synonymous codons in their family, but only the third position is actually recoded;</p>
                     </list-item>
                     <list-item id="listitem0015">
                        <label>-</label>
                        <p id="p0075">‘degen1LR’: the codons are replaced with degenerate versions representing all synonymous codons in their family, but only at first positions of codons coding a leucine or an arginine;</p>
                     </list-item>
                     <list-item id="listitem0020">
                        <label>-</label>
                        <p id="p0080">‘degen12S’: the codons are replaced with degenerate versions representing all synonymous codons in their family, but only at first and second positions of codons coding for a serine;</p>
                     </list-item>
                     <list-item id="listitem0025">
                        <label>-</label>
                        <p id="p0085">‘degen12LRS’: the codons are replaced with degenerate versions representing all synonymous codons in their family, but only at first and second positions of codons coding for a leucine, an arginine or a serine;</p>
                     </list-item>
                     <list-item id="listitem0030">
                        <label>-</label>
                        <p id="p0090">‘degenLR3’: the codons are replaced with degenerate versions representing all synonymous codons in their family, but if the codon codes anything else than a leucine or an arginine, only the third position is actually recoded;</p>
                     </list-item>
                     <list-item id="listitem0035">
                        <label>-</label>
                        <p id="p0095">‘degenS3’: the codons are replaced with degenerate versions representing all synonymous codons in their family, but if the codon codes anything else than a serine, only the third position is actually recoded;</p>
                     </list-item>
                     <list-item id="listitem0040">
                        <label>-</label>
                        <p id="p0100">‘degenLRS3’: the codons are replaced with degenerate versions representing all synonymous codons in their family, but if the codon codes anything other than a leucine, an arginine or a serine, only the third position is actually recoded (since only leucine, arginine and serine are affected by codon synonymy at other positions than the third one, this actually amounts to replacing every (non-terminating) codon by the degenerate version representing its entire family);</p>
                     </list-item>
                     <list-item id="listitem0045">
                        <label>-</label>
                        <p id="p0105">‘translated’: the amino-acid translation of the matrix is used.</p>
                     </list-item>
                  </list>
               </p>
            </sec>
            <sec>
               <p id="p0110">The maximum likelihood analysis of the original nucleotide protein-coding data was replicated using a partitioning of the model by which parameters of the GTR + I + Γ model were estimated independently for each codon position, branch lengths being optimized jointly on the three partitions (‘unrecoded_p’ analytical strategy). This also is expected to provide more accurate results.</p>
            </sec>
            <sec>
               <p id="p0115">More sophisticated molecular evolution models were used, in order to take into account some aspects of composition heterogeneity. The CAT model implemented in Phylobayes (<xref rid="bib0075" ref-type="bibr">Lartillot and Philippe, 2004</xref>) allows different sites to evolve under different nucleotide equilibrium frequencies. The NDCH model implemented in P4 (<xref rid="bib0050" ref-type="bibr">Foster, 2004</xref>) allows different branches of the tree to evolve under different nucleotide equilibrium frequencies.</p>
            </sec>
            <sec>
               <p id="p0120">The analyses using Phylobayes version 3.2f (named ‘CAT’) were performed as follows: Two Markov chain Monte Carlo (MCMC) were run under a GTR + I + Γ model, with the automatic stopping criterion based on the computation of convergence statistics between two chains (‘maxdiff’ &gt; 0.3 and ‘effective size’ &gt; 50, checking every 100 cycles).</p>
            </sec>
            <sec>
               <p id="p0125">The analyses using P4 versions 0.88.r190 and 0.88.r186 (named ‘NDCH’) were performed as follows: Four Metropolis coupled Markov chain Monte Carlo (MCMCMC) were run under a GTR + I + Γ model until the log likelihood values of the cold chain appeared to have plateaued and the ESS sampling values were higher than 50 (preferably at least 200, but that was not always achieved). A posterior predictive distribution (<xref rid="bib0005" ref-type="bibr">Bollback, 2002</xref>) of the X<sup>2</sup> composition heterogeneity statistic was generated using data simulated on the Markov chain samples (<xref rid="bib0050" ref-type="bibr">Foster, 2004</xref>). This distribution was compared to the X<sup>2</sup> of the empirical data to evaluate the ability of the model to account for composition heterogeneity across the tree. This comparison was implemented as a one-tailed area probability test (with a <italic>p</italic>-value deemed significant if smaller than 0.05). As long as the model did generate data significantly less heterogeneous than the empirical data, additional composition vectors were added and new analyses were performed under a GTR + I + Γ + <italic>n</italic>CV model (where <italic>n</italic> is the total number of composition vectors), using the same procedure as above. In practice, two composition vectors proved enough for every analysed dataset. The results of the ‘NDCH’ analyses therefore correspond to Bayesian analyses under a GTR + I + Γ + 2CV model.</p>
            </sec>
            <sec>
               <p id="p0130">In order to achieve better comparability with bootstrap analyses, for both the ‘CAT’ and ‘NDCH’ analytical strategies, the coherence of the results was measured using a random selection of 200 post-burnin samples from the Markov chain.</p>
            </sec>
            <sec>
               <p id="p0135">In addition, three other analytical strategies that were expected to have a worse performance than the ‘unrecoded’ maximum likelihood strategy were applied: a parsimony analysis (‘pars’) and two distance-based approaches, one using Jukes–Cantor distances (‘JCdist’) and the other LogDet distances (‘logdet’, <xref rid="bib0095" ref-type="bibr">Lockhart et al., 1994</xref>). All three strategies were performed using tools from the version 3.69 of the phylip package (<xref rid="bib0045" ref-type="bibr">Felsenstein, 2005</xref>) with 200 bootstrap pseudo-replicates of the data.</p>
            </sec>
         </sec>
         <sec id="sec0025">
            <label>2.3</label>
            <title id="sect0045">Coherence measure</title>
            <sec>
               <p id="p0140">The coherence of the results obtained by a given analytical strategy across a series of datasets could be evaluated in various ways, and the design of a method to accomplish such an evaluation could be a research topic in its own right. In the present work, the coherence was evaluated on the basis of pairwise topological distances between trees obtained by applying the analytical strategy on the datasets. The shorter the topological distances, the more ‘similar’ the trees, the more coherent the results.</p>
            </sec>
            <sec>
               <p id="p0145">The Robinson–Foulds symmetric difference (<xref rid="bib0120" ref-type="bibr">Robinson and Foulds, 1981</xref>) was used as topological distance. If we note <italic>A</italic> and <italic>B</italic> the sets of bipartitions defined by the branches of two trees on a same set of leaves, the Robinson–Foulds distance between these two trees is the number of bipartitions present in <italic>A</italic> or in <italic>B</italic> but not in both (see <xref rid="fig0005" ref-type="fig">Fig. 1</xref>).</p>
            </sec>
            <sec>
               <p id="p0150">The coherence of a given analytical strategy can be evaluated using the Robinson–Foulds distances between pairs of trees obtained by applying that strategy on different datasets sharing the same set of leaves. If the sets of leaves differ between datasets, the trees have to be reduced to the set of common leaves. This might raise important issues that are beyond the scope of the present paper. Here, the four datasets delimited by concaterpillar included the same terminal taxa.</p>
            </sec>
            <sec>
               <p id="p0155">In the present work, the analytical strategies included the generation of 200 bootstrap trees or the extraction of 200 samples from a Markov chain. The coherence was therefore evaluated using the distances between pairs of consensus trees (4 consensuses of 200 trees because there are 4 datasets, which makes 6 pairs) and the full distribution of the distances between pairs of bootstrap or MCMC-sampled trees (200<sup>2</sup> = 40 000 distances for a pair of datasets, 6 pairs of datasets).</p>
            </sec>
         </sec>
      </sec>
      <sec id="sec0030">
         <label>3</label>
         <title id="sect0050">Results</title>
         <sec>
            <p id="p0160">The coherence measures are reported in <xref rid="fig0010" ref-type="fig">Fig. 2</xref>.</p>
         </sec>
         <sec>
            <p id="p0165">A striking observation is that the analytical strategies with the highest coherence are those where the signal corresponding to synonymous substitutions at third codon positions has been eliminated (the names of these strategies end in ‘3’) and the strategy using translated data (by which an important part of the variability corresponding to third codon positions is eliminated due to codon synonymy). The next most coherent strategy is the ‘NDCH’ strategy, that models the existence of composition heterogeneities across the phylogeny.</p>
         </sec>
         <sec>
            <p id="p0170">All of these strategies appear more coherent than the reference ‘unrecoded’ strategy, where all signal is present in the data and a simple composition-homogeneous GTR + I + Γ model is used.</p>
         </sec>
         <sec>
            <p id="p0175">This difference in performance is likely to reflect a real difference in reconstruction accuracy. Indeed, both the suppression of the signal associated with synonymy at third codon position and the modelling of composition heterogeneities can reduce the risks of obtaining reconstruction artefacts driven by convergences in genome composition biases, which we know affect the data used here (<xref rid="bib0090" ref-type="bibr">Li et al., in preparation</xref>). The recoding strategies leaving third positions unaffected do not seem to lead to a higher coherence than the ‘unrecoded’ strategy.</p>
         </sec>
         <sec>
            <p id="p0180">Contrary to what was expected, the analytical strategy where the model is partitioned by codon position (‘unrecoded_p’) does not seem to perform strikingly better than its non-partitioned counterpart (‘unrecoded’). The average <italic>RF</italic> distance between bootstrap consensuses is only slightly lower for ‘unrecoded_p’ than for ‘unrecoded’.</p>
         </sec>
         <sec>
            <p id="p0185">As expected, the analytical strategies using parsimony (‘pars’) and distance (‘JCdist’ and ‘logdet’) seem to produce results overall less coherent than the ‘unrecoded’ strategy. It was however not expected that the strategy using a site-heterogeneous model (‘CAT’) would also produce less coherent results than the ‘unrecoded’ strategy. It should be noted that the statistical significance of this observation is not known. It may be that the ‘CAT’ strategy reveals true divergences between the evolutionary histories of the different dataset (see Section <xref rid="sec0035" ref-type="sec">4</xref>).</p>
         </sec>
         <sec>
            <p id="p0190">The ‘NDCH’ strategy used the program P4, which can produce trees with polytomies during the Markov chain. A tree with polytomies has less bipartitions than a fully bifurcating tree. The lower the numbers of bipartitions there are in a pair of trees, the lowest the upper bound on the <italic>RF</italic> distance between these trees. Indeed <italic>RF</italic> is the highest when the trees have no bipartition in common, and it is then equal to the sum of the numbers of bipartitions found in the trees. This may bias the coherence evaluation used in the present work. The same problem holds for the ‘pars’ analytical strategy, because parsimony sometimes results in a set of equally parsimonious trees, and the (polytomous) consensus of these trees is used as the result of the analysis.</p>
         </sec>
         <sec>
            <p id="p0195">This may explain why the ‘NDCH’ strategy had the lowest average <italic>RF</italic> distances between individual sampled trees, although the <italic>RF</italic> distances between consensus trees were not. And similarly, this may explain why the average <italic>RF</italic> distance between individual trees for the ‘pars’ strategy was not the highest, whereas it was the case for the distances between the bootstrap consensus trees for this strategy.</p>
         </sec>
         <sec>
            <p id="p0200">The distances between bootstrapped trees are in average higher than the distances between their consensuses. The bootstrapped trees for a given dataset are by nature dispersed because they are built using different matrices that are samples of the sites from the original alignment that possibly support diverging topologies, but their consensus is expected to reflect the signal present in the full alignment. If all datasets bear the same historical structure, and provided that this structure is efficiently revealed by the analytical strategy, a better coherence is expected between bootstrap consensuses than between the diverse bootstrapped trees.</p>
         </sec>
         <sec>
            <p id="p0205">This discrepancy in coherence between individual trees and between their consensuses is not observed in the case of MCMC-sampled trees (‘NDCH’ and ‘CAT’). The trees sampled from a stationary MCMC are all supposed to be drawn from the a posteriori probability distribution of trees given the full data matrix. If this distribution is centered around one highly dominant most probable topology, the topological dispersion observed within the sample will be low. This is often observed: be it a true characteristic of Bayesian posterior distributions in phylogeny, or because of prevalent inefficiencies in MCMC mixing (<xref rid="bib0070" ref-type="bibr">Lakner et al., 2008</xref>), consensus trees obtained from MCMC samples often have high node posterior probabilities (<xref rid="bib0030" ref-type="bibr">Douady et al., 2003</xref>).</p>
         </sec>
      </sec>
      <sec id="sec0035">
         <label>4</label>
         <title id="sect0055">Discussion</title>
         <sec>
            <p id="p0210">The most accurate strategies should extract more historical signal than the others. Consequently, independent datasets issued from a same evolutionary history should produce more similar trees when using these strategies than when using less accurate analytical strategies (<xref rid="fig0015" ref-type="fig">Fig. 3</xref>a). This seems to be the case to some extent with our data, because the analytical strategies that are designed to counter the misleading effects of convergence in composition bias (‘degen*3’, ‘translated’, and ‘NDCH’) tend to produce more coherent topologies than the other strategies. If the accuracy improvement brought by these strategies is important enough, the hypothesis can be made that their use in concaterpillar (instead of GTR + I + Γ maximum likelihood analyses on unrecoded data) could yield a smaller number of combined datasets, due to a higher compatibility between the phylogenies supported by the individual protein-coding genes (and their combinations) under strategies more apt to overcome reconstruction artefacts.</p>
         </sec>
         <sec>
            <p id="p0215">The coherence-based <italic>a posteriori</italic> approach will probably not perform well when the different datasets were generated on different histories. A good analytical strategy should produce coherent results only within a set of datasets sharing the same history. Otherwise, result dispersion is expected. It is difficult to tell to what extent such dispersion will be strong relative to dispersion due to the use of inaccurate strategies. Such strategies might then be difficult to distinguish from more accurate ones (<xref rid="fig0015" ref-type="fig">Fig. 3</xref>b). The surprisingly high relative incoherence obtained using the ‘CAT’ strategy may be due to the existence of conflicts between datasets that can only be revealed when heterogeneity of composition across sites is taken into account. Such conflicts are not unexpected, due to the bacterial nature of the taxa used for the present tests. The type of pitfall described in <xref rid="fig0015" ref-type="fig">Fig. 3</xref>b may be less a problem with organisms not so prone to horizontal transfers of genetic material.</p>
         </sec>
         <sec>
            <p id="p0220">But even in cases where datasets all result from the same history, a potential pitfall exists that depends in the way the less accurate analytical strategies are inaccurate. If they are inaccurate in a systematically biased way, then the trees may be similar due to shared wrongly inferred relationships (<xref rid="fig0015" ref-type="fig">Fig. 3</xref>c). Including ‘control’ analytical strategies chosen for their known sensitivity to some specific systematically biased artefacts could help the detection of false positives due to these biases. If a control analytical strategy happens to produce more coherent results than other strategies a priori more likely to be accurate, this is a hint that the reconstruction artefact can affect some analyses, and that the coherence criterion should be considered with caution.</p>
         </sec>
         <sec>
            <p id="p0225">The coherence measure used here is based on <italic>RF</italic> distances, which makes it possibly biased because analytical strategies producing less resolved trees might never look as incoherent as strategies producing fully bifurcating trees. However, it is not sure that this is an undesired property. Indeed, one might want to favour analytical strategies that are conservative and avoid displaying relationships not well enough supported.</p>
         </sec>
      </sec>
      <sec id="sec0040">
         <label>5</label>
         <title id="sect0060">Conclusion</title>
         <sec>
            <p id="p0230">The coherence-based <italic>a posteriori</italic> approach tested here seems to behave partially as expected. Some improvements can probably be made in the way coherence is measured, but the potential pitfalls to which it might be sensitive seem difficult to rule out. The presented approach may not be suitable as a good selection tool in practice. Different analytical strategies may be equally coherent but for different (bad or good) reasons. It may therefore be advisable to use such a method as an exploratory tool rather than as a decision tool.</p>
         </sec>
      </sec>
   </body>
   <back>
      <ack>
         <title id="sect0065">Acknowledgements</title>
         <p id="p0235">Thanks to João Sollari Lopes and Cymon J. Cox for assembling and providing the datasets that were used in this study. Thanks to Michel Laurin and the SFS for giving me the opportunity to present these ideas at the ‘Journées d’automne 2012 de la SFS’. Thanks to the editor and reviewers for their comments and advice that hopefully helped me improving the present paper.</p>
         <p id="p0240">This work was supported by a <funding-source id="gs0005">
               <institution-wrap>
                  <institution>Fundação para a Ciência e a Tecnologia (FCT, Portugal)</institution>
               </institution-wrap>
            </funding-source> grant to Cymon J. Cox, Centro de Ciências do Mar (CCMAR) – CIMAR-Lab. Assoc., (PTDC/BIA-BCM/099565/2008).</p>
      </ack>
      <app-group>
         <app>
            <sec id="sec0050">
               <label>Appendix A</label>
               <title id="sect0075">Supplementary data</title>
               <sec>
                  <p id="p0250">The following is the supplementary data to this article: <supplementary-material xmlns:xlink="http://www.w3.org/1999/xlink" id="mmc0005" xlink:href="main.assets/mmc1.pdf"/>
                  </p>
               </sec>
            </sec>
         </app>
      </app-group>
      <ref-list>
         <ref id="bib0005">
            <label>Bollback, 2002</label>
            <element-citation id="sbref0005" publication-type="article">
               <name>
                  <surname>Bollback</surname>
                  <given-names>J.P.</given-names>
               </name>
               <article-title>Bayesian model adequacy and choice in phylogenetics</article-title>
               <source>Mol. Biol. Evol.</source>
               <volume>19</volume>
               <year>2002</year>
               <page-range>1171–1180</page-range>
            </element-citation>
         </ref>
         <ref id="bib0010">
            <label>Brinkmann and Philippe, 1999</label>
            <element-citation id="sbref0010" publication-type="article">
               <name>
                  <surname>Brinkmann</surname>
                  <given-names>H.</given-names>
               </name>
               <name>
                  <surname>Philippe</surname>
                  <given-names>H.</given-names>
               </name>
               <article-title>Archaea sister group of bacteria? Indications from tree reconstruction artifacts in ancient phylogenies</article-title>
               <source>Mol. Biol. Evol.</source>
               <volume>16</volume>
               <year>1999</year>
               <page-range>817–825</page-range>
            </element-citation>
         </ref>
         <ref id="bib0015">
            <label>Chen et al., 2003</label>
            <element-citation id="sbref0015" publication-type="article">
               <name>
                  <surname>Chen</surname>
                  <given-names>W.J.</given-names>
               </name>
               <name>
                  <surname>Bonillo</surname>
                  <given-names>C.</given-names>
               </name>
               <name>
                  <surname>Lecointre</surname>
                  <given-names>G.</given-names>
               </name>
               <article-title>Repeatability of clades as a criterion of reliability: a case study for molecular phylogeny of Acanthomorpha (Teleostei) with larger number of taxa</article-title>
               <source>Mol. Phylogenet. Evol</source>
               <volume>26</volume>
               <year>2003</year>
               <page-range>262–288</page-range>
            </element-citation>
         </ref>
         <ref id="bib0020">
            <label>Cox et al., 2008</label>
            <element-citation id="sbref0020" publication-type="article">
               <name>
                  <surname>Cox</surname>
                  <given-names>C.J.</given-names>
               </name>
               <name>
                  <surname>Foster</surname>
                  <given-names>P.G.</given-names>
               </name>
               <name>
                  <surname>Hirt</surname>
                  <given-names>R.P.</given-names>
               </name>
               <name>
                  <surname>Harris</surname>
                  <given-names>S.R.</given-names>
               </name>
               <name>
                  <surname>Embley</surname>
                  <given-names>M.T.</given-names>
               </name>
               <article-title>The archaebacterial origin of eukaryotes</article-title>
               <source>Proc. Natl. Acad. Sci. U.S.A.</source>
               <volume>105</volume>
               <year>2008</year>
               <page-range>20356–20361</page-range>
            </element-citation>
         </ref>
         <ref id="bib0025">
            <label>Dettai and Lecointre, 2004</label>
            <element-citation id="sbref0025" publication-type="article">
               <name>
                  <surname>Dettai</surname>
                  <given-names>A.</given-names>
               </name>
               <name>
                  <surname>Lecointre</surname>
                  <given-names>G.</given-names>
               </name>
               <article-title>In search of nothothenioid (Teleostei) relatives</article-title>
               <source>Antarct. Sci.</source>
               <volume>16</volume>
               <year>2004</year>
               <page-range>71–85</page-range>
            </element-citation>
         </ref>
         <ref id="bib0030">
            <label>Douady et al., 2003</label>
            <element-citation id="sbref0030" publication-type="article">
               <name>
                  <surname>Douady</surname>
                  <given-names>C.</given-names>
               </name>
               <name>
                  <surname>Delsuc</surname>
                  <given-names>F.</given-names>
               </name>
               <name>
                  <surname>Boucher</surname>
                  <given-names>Y.</given-names>
               </name>
               <name>
                  <surname>Doolittle</surname>
                  <given-names>F.</given-names>
               </name>
               <name>
                  <surname>Douzery</surname>
                  <given-names>E.</given-names>
               </name>
               <article-title>Comparison of Bayesian and maximum likelihood bootstrap measures of phylogenetic reliability</article-title>
               <source>Mol. Biol. Evol.</source>
               <volume>20</volume>
               <year>2003</year>
               <page-range>248–254</page-range>
            </element-citation>
         </ref>
         <ref id="bib0035">
            <label>Felsenstein, 1981</label>
            <element-citation id="sbref0035" publication-type="article">
               <name>
                  <surname>Felsenstein</surname>
                  <given-names>J.</given-names>
               </name>
               <article-title>Evolutionary trees from DNA sequences: a maximum likelihood approach</article-title>
               <source>J. Mol. Evol.</source>
               <volume>17</volume>
               <year>1981</year>
               <page-range>368–376</page-range>
            </element-citation>
         </ref>
         <ref id="bib0040">
            <label>Felsenstein, 1985</label>
            <element-citation id="sbref0040" publication-type="article">
               <name>
                  <surname>Felsenstein</surname>
                  <given-names>J.</given-names>
               </name>
               <article-title>Confidence limits on phylogenies: an approach using the bootstrap</article-title>
               <source>Evolution</source>
               <volume>39</volume>
               <year>1985</year>
               <page-range>783–791</page-range>
            </element-citation>
         </ref>
         <ref id="bib0045">
            <label>Felsenstein, 2005</label>
            <element-citation id="sbref0045" publication-type="book">
               <name>
                  <surname>Felsenstein</surname>
                  <given-names>J.</given-names>
               </name>
               <source>PHYLIP. Phylogeny Inference Package, Version 3.6</source>
               <year>2005</year>
               <publisher-name>Department of Genome Sciences and Departement of Biology, University of Washington</publisher-name>
               <publisher-loc>Seattle</publisher-loc>
            </element-citation>
         </ref>
         <ref id="bib0050">
            <label>Foster, 2004</label>
            <element-citation id="sbref0050" publication-type="article">
               <name>
                  <surname>Foster</surname>
                  <given-names>P.G.</given-names>
               </name>
               <article-title>Modeling compositional heterogeneity</article-title>
               <source>Syst. Biol.</source>
               <volume>53</volume>
               <year>2004</year>
               <page-range>485–495</page-range>
            </element-citation>
         </ref>
         <ref id="bib0055">
            <label>Goremykin et al., 2010</label>
            <element-citation id="sbref0055" publication-type="article">
               <name>
                  <surname>Goremykin</surname>
                  <given-names>V.V.</given-names>
               </name>
               <name>
                  <surname>Nikiforova</surname>
                  <given-names>S.V.</given-names>
               </name>
               <name>
                  <surname>Bininda-Edmonds</surname>
                  <given-names>O.R.P.</given-names>
               </name>
               <article-title>Automated removal of noisy data in phylogenomic analyses</article-title>
               <source>J. Mol. Evol.</source>
               <volume>71</volume>
               <year>2010</year>
               <page-range>319–331</page-range>
            </element-citation>
         </ref>
         <ref id="bib0060">
            <label>Hassanin et al., 2005</label>
            <element-citation id="sbref0060" publication-type="article">
               <name>
                  <surname>Hassanin</surname>
                  <given-names>A.</given-names>
               </name>
               <name>
                  <surname>Léger</surname>
                  <given-names>N.</given-names>
               </name>
               <name>
                  <surname>Deutsch</surname>
                  <given-names>J.</given-names>
               </name>
               <article-title>Evidence for multiple reversals of asymmetric mutational constraints during the evolution of the mitochondrial genome of Metazoa, and consequences for phylogenetic inferences</article-title>
               <source>Syst. Biol.</source>
               <volume>54</volume>
               <year>2005</year>
               <page-range>277–298</page-range>
            </element-citation>
         </ref>
         <ref id="bib0065">
            <label>Inagaki et al., 2004</label>
            <element-citation id="sbref0065" publication-type="article">
               <name>
                  <surname>Inagaki</surname>
                  <given-names>Y.</given-names>
               </name>
               <name>
                  <surname>Simpson</surname>
                  <given-names>A.G.B.</given-names>
               </name>
               <name>
                  <surname>Dacks</surname>
                  <given-names>J.B.</given-names>
               </name>
               <name>
                  <surname>Roger</surname>
                  <given-names>A.J.</given-names>
               </name>
               <article-title>Phylogenetic artifacts can be caused by leucine, serine and arginine codon usage heterogeneity: Dinoflagellate plastid origins as a case study</article-title>
               <source>Syst. Biol.</source>
               <volume>53</volume>
               <year>2004</year>
               <page-range>582–593</page-range>
            </element-citation>
         </ref>
         <ref id="bib0070">
            <label>Lakner et al., 2008</label>
            <element-citation id="sbref0070" publication-type="article">
               <name>
                  <surname>Lakner</surname>
                  <given-names>C.</given-names>
               </name>
               <name>
                  <surname>van der Mark</surname>
                  <given-names>P.</given-names>
               </name>
               <name>
                  <surname>Huelsenbeck</surname>
                  <given-names>J.P.</given-names>
               </name>
               <name>
                  <surname>Larget</surname>
                  <given-names>B.</given-names>
               </name>
               <name>
                  <surname>Ronquist</surname>
                  <given-names>F.</given-names>
               </name>
               <article-title>Efficiency of Markov chain Monte Carlo tree proposals in Bayesian phylogenetics</article-title>
               <source>Syst. Biol.</source>
               <volume>57</volume>
               <year>2008</year>
               <page-range>86–103</page-range>
            </element-citation>
         </ref>
         <ref id="bib0075">
            <label>Lartillot and Philippe, 2004</label>
            <element-citation id="sbref0075" publication-type="article">
               <name>
                  <surname>Lartillot</surname>
                  <given-names>N.</given-names>
               </name>
               <name>
                  <surname>Philippe</surname>
                  <given-names>H.</given-names>
               </name>
               <article-title>A Bayesian mixture model for across-site heterogeneities in the amino-acid replacement process</article-title>
               <source>Mol. Biol. Evol.</source>
               <volume>21</volume>
               <year>2004</year>
               <page-range>1095–1109</page-range>
            </element-citation>
         </ref>
         <ref id="bib0080">
            <label>Leigh et al., 2008</label>
            <element-citation id="sbref0080" publication-type="article">
               <name>
                  <surname>Leigh</surname>
                  <given-names>J.W.</given-names>
               </name>
               <name>
                  <surname>Susko</surname>
                  <given-names>E.</given-names>
               </name>
               <name>
                  <surname>Baumgartner</surname>
                  <given-names>M.</given-names>
               </name>
               <name>
                  <surname>Roger</surname>
                  <given-names>A.J.</given-names>
               </name>
               <article-title>Testing congruence in phylogenomic analysis</article-title>
               <source>Syst. Biol.</source>
               <volume>57</volume>
               <year>2008</year>
               <page-range>104–115</page-range>
            </element-citation>
         </ref>
         <ref id="bib0085">
            <label>Li and Lecointre, 2009</label>
            <element-citation id="sbref0085" publication-type="article">
               <name>
                  <surname>Li</surname>
                  <given-names>B.</given-names>
               </name>
               <name>
                  <surname>Lecointre</surname>
                  <given-names>G.</given-names>
               </name>
               <article-title>Formalizing reliability in the taxonomic congruence approach</article-title>
               <source>Zool. Scr.</source>
               <volume>38</volume>
               <year>2009</year>
               <page-range>101–112</page-range>
            </element-citation>
         </ref>
         <ref id="bib0090">
            <label>Li et al., in preparation</label>
            <mixed-citation id="oref0090">Li, B., Sollari Lopes, J., Foster, P.G., Embley, T.M., Cox, C.J. The origin of plastids: a case study of phylogenetic conflict among protein-coding genes and their proteins resulting from compositional biases at synonymous substitution sites (in preparation).</mixed-citation>
         </ref>
         <ref id="bib0095">
            <label>Lockhart et al., 1994</label>
            <element-citation id="sbref0095" publication-type="article">
               <name>
                  <surname>Lockhart</surname>
                  <given-names>P.J.</given-names>
               </name>
               <name>
                  <surname>Steel</surname>
                  <given-names>M.A.</given-names>
               </name>
               <name>
                  <surname>Hendy</surname>
                  <given-names>M.D.</given-names>
               </name>
               <name>
                  <surname>Penny</surname>
                  <given-names>D.</given-names>
               </name>
               <article-title>Recovering evolutionary trees under a more realistic model of sequence evolution</article-title>
               <source>Mol. Biol. Evol.</source>
               <volume>11</volume>
               <year>1994</year>
               <page-range>605–612</page-range>
            </element-citation>
         </ref>
         <ref id="bib0100">
            <label>Miyamoto and Fitch, 1995</label>
            <element-citation id="sbref0100" publication-type="article">
               <name>
                  <surname>Miyamoto</surname>
                  <given-names>M.</given-names>
               </name>
               <name>
                  <surname>Fitch</surname>
                  <given-names>W.</given-names>
               </name>
               <article-title>Testing species phylogenies and phylogenetic methods with congruence</article-title>
               <source>Syst. Biol.</source>
               <volume>44</volume>
               <year>1995</year>
               <page-range>64–75</page-range>
            </element-citation>
         </ref>
         <ref id="bib0105">
            <label>Miyamoto et al., 1994</label>
            <element-citation id="sbref0105" publication-type="article">
               <name>
                  <surname>Miyamoto</surname>
                  <given-names>M.M.</given-names>
               </name>
               <name>
                  <surname>Allard</surname>
                  <given-names>M.W.</given-names>
               </name>
               <name>
                  <surname>Adkins</surname>
                  <given-names>R.M.</given-names>
               </name>
               <name>
                  <surname>Janecek</surname>
                  <given-names>L.L.</given-names>
               </name>
               <name>
                  <surname>Honeycutt</surname>
                  <given-names>R.L.</given-names>
               </name>
               <article-title>A congruence test of reliability using linked mitochondrial DNA sequences</article-title>
               <source>Syst. Biol.</source>
               <volume>43</volume>
               <year>1994</year>
               <page-range>236–249</page-range>
            </element-citation>
         </ref>
         <ref id="bib0110">
            <label>Nabholz et al., 2011</label>
            <element-citation id="sbref0110" publication-type="article">
               <name>
                  <surname>Nabholz</surname>
                  <given-names>B.</given-names>
               </name>
               <name>
                  <surname>Künstner</surname>
                  <given-names>A.</given-names>
               </name>
               <name>
                  <surname>Wang</surname>
                  <given-names>R.</given-names>
               </name>
               <name>
                  <surname>Jarvis</surname>
                  <given-names>E.D.</given-names>
               </name>
               <name>
                  <surname>Ellegren</surname>
                  <given-names>H.</given-names>
               </name>
               <article-title>Dynamic evolution of base composition: causes and consequences in avian phylogenomics</article-title>
               <source>Mol. Biol. Evol.</source>
               <volume>28</volume>
               <year>2011</year>
               <page-range>2197–2210</page-range>
            </element-citation>
         </ref>
         <ref id="bib0115">
            <label>Posada, 2008</label>
            <element-citation id="sbref0115" publication-type="article">
               <name>
                  <surname>Posada</surname>
                  <given-names>D.</given-names>
               </name>
               <article-title>jModelTest: phylogenetic model averaging</article-title>
               <source>Mol. Biol. Evol</source>
               <volume>25</volume>
               <year>2008</year>
               <page-range>1253–1256</page-range>
            </element-citation>
         </ref>
         <ref id="bib0120">
            <label>Robinson and Foulds, 1981</label>
            <element-citation id="sbref0120" publication-type="article">
               <name>
                  <surname>Robinson</surname>
                  <given-names>D.F.</given-names>
               </name>
               <name>
                  <surname>Foulds</surname>
                  <given-names>L.R.</given-names>
               </name>
               <article-title>Comparison of phylogenetic trees</article-title>
               <source>Math. Biosci.</source>
               <volume>53</volume>
               <year>1981</year>
               <page-range>131–147</page-range>
            </element-citation>
         </ref>
         <ref id="bib0125">
            <label>Rota-Stabelli et al., 2013</label>
            <element-citation id="sbref0125" publication-type="article">
               <name>
                  <surname>Rota-Stabelli</surname>
                  <given-names>O.</given-names>
               </name>
               <name>
                  <surname>Lartillot</surname>
                  <given-names>N.</given-names>
               </name>
               <name>
                  <surname>Philippe</surname>
                  <given-names>H.</given-names>
               </name>
               <name>
                  <surname>Pisani</surname>
                  <given-names>D.</given-names>
               </name>
               <article-title>Serine codon-usage bias in deep phylogenomics: pancrustacean relationships as a case study</article-title>
               <source>Syst. Biol.</source>
               <volume>62</volume>
               <year>2013</year>
               <page-range>121–133</page-range>
            </element-citation>
         </ref>
         <ref id="bib0130">
            <label>Roure and Philippe, 2011</label>
            <element-citation id="sbref0130" publication-type="article">
               <name>
                  <surname>Roure</surname>
                  <given-names>B.</given-names>
               </name>
               <name>
                  <surname>Philippe</surname>
                  <given-names>H.</given-names>
               </name>
               <article-title>Site-specific time heterogeneity of the substitution process and its impact on phylogenetic inference</article-title>
               <source>BMC Evol. Biol.</source>
               <year>2011</year>
               <page-range>11</page-range>
            </element-citation>
         </ref>
         <ref id="bib0135">
            <label>Simmons, 2012</label>
            <element-citation id="sbref0135" publication-type="article">
               <name>
                  <surname>Simmons</surname>
                  <given-names>M.P.</given-names>
               </name>
               <article-title>Misleading results of likelihood-based phylogenetic analyses in the presence of missing data</article-title>
               <source>Cladistics</source>
               <volume>28</volume>
               <year>2012</year>
               <page-range>208–222</page-range>
            </element-citation>
         </ref>
         <ref id="bib0140">
            <label>Stamatakis, 2006</label>
            <element-citation id="sbref0140" publication-type="article">
               <name>
                  <surname>Stamatakis</surname>
                  <given-names>A.</given-names>
               </name>
               <article-title>RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models</article-title>
               <source>Bioinformatics</source>
               <volume>22</volume>
               <year>2006</year>
               <page-range>2688–2690</page-range>
            </element-citation>
         </ref>
         <ref id="bib0145">
            <label>Yang, 1993</label>
            <element-citation id="sbref0145" publication-type="article">
               <name>
                  <surname>Yang</surname>
                  <given-names>Z.</given-names>
               </name>
               <article-title>Maximum-likelihood estimation of phylogeny from DNA sequences when substitution rates differ over sites</article-title>
               <source>Mol. Biol. Evol.</source>
               <volume>10</volume>
               <year>1993</year>
               <page-range>1396–1401</page-range>
            </element-citation>
         </ref>
         <ref id="bib0150">
            <label>Yang and Rannala, 1997</label>
            <element-citation id="sbref0150" publication-type="article">
               <name>
                  <surname>Yang</surname>
                  <given-names>Z.</given-names>
               </name>
               <name>
                  <surname>Rannala</surname>
                  <given-names>B.</given-names>
               </name>
               <article-title>Bayesian phylogenetic inference using DNA sequences: a Markov chain Monte Carlo method</article-title>
               <source>Mol. Biol. Evol.</source>
               <volume>14</volume>
               <year>1997</year>
               <page-range>717–724</page-range>
            </element-citation>
         </ref>
      </ref-list>
   </back>
   <floats-group>
      <fig id="fig0005">
         <label>Fig. 1</label>
         <caption>
            <p id="spar0015">Robinson–Foulds symmetric difference distance (<italic>RF</italic>). <italic>A</italic> and <italic>B</italic> represent the sets of bipartitions defined by the branches of two trees. The grey area is the set of bipartitions that belong to either <italic>A</italic> or <italic>B</italic> but not to both. This set is the symmetric difference of <italic>A</italic> and <italic>B</italic>. The Robinson–Foulds distance between the two trees is the number of elements in this set.</p>
            <p id="spar0030">
               <bold>Fig. 1.</bold>Distance de différence symétrique de Robinson–Foulds (RF). A et B représentent les ensembles de bipartitions définis par les branches de deux arbres. La zone grisée est l'ensemble des bipartitions qui appartiennent, soit à A, soit à B, mais pas aux deux. Cet ensemble est la différence symétrique de A et B. La distance de Robinson–Foulds entre les deux arbres est le nombre d'éléments dans cet ensemble.</p>
         </caption>
         <graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="main.assets/gr1.jpg"/>
      </fig>
      <fig id="fig0010">
         <label>Fig. 2</label>
         <caption>
            <p id="spar0020">Coherence measures for the analytical strategies tested in this study. The vertical axis bears units of Robinson–Foulds topological distances (<italic>RF</italic>), that is, numbers of bipartitions that are not shared between a pair of trees. The lower the value, the more similar the trees are. The strategies are sorted according to the mean (blue circles) of the 6 <italic>RF</italic> distances between pairs of consensus trees (red stars). Those on the left are therefore the most coherent, and, presumably, the most accurate. The distribution of distances between individual bootstrapped or MCMC-sampled trees is represented by its 95% range and its mean (in green dots). See the text for explanations about the names of the analytical strategies.</p>
            <p id="spar0035">
               <bold>Fig. 2.</bold> Mesures de cohérence pour les stratégies d'analyse testées dans cette étude. L'axe vertical est gradué en unités de distance topologique de Robinson–Foulds (RF), c'est-à-dire en nombres de bipartitions qui ne sont pas partagées par une paire d'arbres. Plus la valeur est faible, plus les arbres sont similaires. Les stratégies sont ordonnées selon la moyenne (cercles bleus) des six distances de RF entre paires de consensus (étoiles rouges). Celles à gauche sont donc les plus cohérentes et sont présumées les plus exactes. La distribution des distances entre arbres individuels générés par <italic>bootstrap</italic> ou échantillonnés par MCMC est représentée par son intervalle à 95% et sa moyenne (en pointillés verts). Voir le texte pour des explications à propos des noms des stratégies d'analyse.</p>
         </caption>
         <graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="main.assets/gr2.jpg"/>
      </fig>
      <fig id="fig0015">
         <label>Fig. 3</label>
         <caption>
            <p id="spar0025">Possible cases for the coherence of results. The symbolic trees on the left represent evolutionary histories. The targets on the right represent the space of results. The datasets in the middle are the product (black arrow) of an evolutionary history, and are used to infer a result (dot on a target whose centre represents the true relationships) according to a given analytical strategy (green or red arrow). The more compact a set of dots, the more coherent the results it represents. (a) The datasets are the products of a same evolutionary history and there is no systematic bias repeatedly affecting the reconstruction. A ‘good’ analytical strategy will generate results close to the true historical relationships. A ‘bad’ analytical strategy will produce more dispersed results. (b) The datasets are the products of different evolutionary histories. A ‘good’ analytical strategy will produce results close to the historical relationships having produced the dataset on which the inference is based. The results will therefore be situated in different areas of the result space. The distribution of the results obtained by a ‘good’ analytical strategy will be less easy to distinguish from the results obtained by a ‘bad’ analytical strategy than when the datasets are the products of a same history. (c) The datasets are the products of a same evolutionary history and the analytical strategy is sensitive to a systematic inference error. The results will tend to concentrate around a zone of the result space corresponding to an artefactual reconstruction. This artefactual reconstruction will therefore correspond to a coherent set of results, and may be mistaken for the true evolutionary history.</p>
            <p id="spar0040">
               <bold>Fig. 3.</bold> Cas de figure possibles de cohérence des résultats. Les arbres symboliques à gauche représentent des histoires évolutives. Les cibles à droite représentent l'espace des résultats. Les jeux de données au milieu sont le produit (flèche noire) d'une histoire évolutive, et sont utilisés pour inférer un résultat (point sur une cible dont le centre représente les vraies relations de parenté) suivant une stratégie d'analyse donnée (flèche verte ou rouge). Plus un ensemble de points est compact, plus les résultats représentés sont cohérents. a : Les jeux de données sont le produit d'une même histoire évolutive et il n'y a pas de biais systématique affectant de manière répétée la reconstruction. Une « bonne » stratégie d'analyse générera des résultats proches des vraies relations de parenté. Une « mauvaise » stratégie d'analyse produira des résultats plus dispersés. b : Les jeux de données sont le produit de différentes histoires évolutives. Une « bonne » stratégie d'analyse produira des résultats proches de l'histoire évolutive ayant produit le jeu de données sur lequel l'inférence est basée. Les résultats seront donc situés dans des zones différentes de l'espace des résultats. La distribution des résultats obtenus par une « bonne » stratégie d'analyse sera plus difficile à distinguer des résultats obtenus par une « mauvaise » stratégie d'analyse que dans le cas où les jeux de données sont le produit d'une même histoire. c : Les jeux de données sont le produit d'une même histoire évolutive et la stratégie d'analyse est sensible à une erreur systématique d'inférence. Les résultats tendront à se concentrer autour d'une zone de l'espace des résultats correspondant à une reconstruction erronée. Cette reconstruction erronée correspondra donc à un ensemble cohérent de résultats, et pourra être considérée par erreur comme l'histoire évolutive réelle.</p>
         </caption>
         <graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="main.assets/gr3.jpg"/>
      </fig>
   </floats-group>
</article>